Human Pose Estimation in Vision Networks Via Distributed Local Processing and Nonparametric Belief Propagation
نویسندگان
چکیده
In this paper we propose a self-initialized method for human pose estimation from multiple cameras. A graphical model for the articulated body is defined through explicit kinematic and structural constraints, which allows for any plausible body configuration and avoids learning the joint distributions from training data. Nonparametric belief propagation (NBP) is used to infer the marginal distributions. However, to address the problem of the inference being trapped in local optima and to achieve fast convergence, a reasonably good pose initialization is required. A bottom-up approach is used to detect body parts distributedly in local processing of each camera. 3D Geometry correspondence relates 2D camera observations spatially to generate a rough pose estimation to initialize node marginal distribution. The marginal distributions are then refined through NBP. Estimated 3D body joint positions are quantitatively analyzed with motion capture data.
منابع مشابه
Graphical models for visual object recognition and tracking
We develop statistical methods which allow effective visual detection, categorization, and tracking of objects in complex scenes. Such computer vision systems must be robust to wide variations in object appearance, the often small size of training databases, and ambiguities induced by articulated or partially occluded objects. Graphical models provide a powerful framework for encoding the stati...
متن کاملDistributed Algorithms for Camera Network Localization and Multiple Target Tracking Using Mobile Robots
Camera networks are gaining popularity due to the information rich sensing modality of cameras. Some important problems arise while migrating from traditional sensor networks to networks that use cameras as their primary sensing modality. One such problem is that of localization. Localization is the process of finding the pose (position and orientation) of objects in a common frame of reference...
متن کاملProbabilistic Pose Recovery Using Learned Hierarchical Object Models
This paper presents a probabilistic representation for 3D objects, and details the mechanism of inferring the pose of real-world objects from vision. Our object model has the form of a hierarchy of increasingly expressive 3D features, and probabilistically represents 3D relations between these. Features at the bottom of the hierarchy are bound to local perceptions; while we currently only use v...
متن کاملSegmentation of objects in a detection window by Nonparametric Inhomogeneous CRFs
This paper presents a method for segmenting objects of a specific class in a given detection window. The task is to label each pixel as belonging to the foreground or the background. We pose the problem as that of finding the maximum a posterior (MAP) estimation in a modified form of Conditional Random Field model that we call a Nonparametric Inhomogeneous CRF (NICRFs). An NICRF, like a convent...
متن کاملLinear State Estimation via 5G C-RAN Cellular Networks using Gaussian Belief Propagation
Machine-type communications and large-scale information processing architectures are among key (r)evolutionary enhancements of emerging fifth-generation (5G) mobile cellular networks. Massive data acquisition and processing will make 5G network an ideal platform for large-scale system monitoring and control with applications in future smart transportation, connected industry, power grids, etc. ...
متن کامل